Verb Classification – Machine Learning Experiments in Classifying Verbs into Semantic Classes

نویسندگان

  • Bart Decadt
  • Walter Daelemans
چکیده

This paper presents the results of our machine learning experiments in verb classification. Using Beth Levin’s semantic classification of the English verbs as a gold standard, we (i) test the hypothesis that the syntactic behavior of a verb can be used to predict its semantic class, and (ii) investigate whether a robust shallow parser can provide the necessary syntactic information. With 277 verbs belonging to six of Levin’s classes, we do type classification experiments using RIPPER, an inductive rule learner. Having only a set of n most likely subjects or objects as features, this machine learning algorithm is able to predict the correct class with ± 58% accuracy. This result is comparable with results from other researchers, like Merlo and Stevenson, Stevenson and Joanis, and Schulte im Walde.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

رشد جنبه معنایی فعل در کودک فارسی‌زبان: مطالعه طولی

Objective Learning “verb” as one of the main components of sentence, has been always a debatable topics in the process of language learning. One of the important issues in “verb” learning is determining its meaning using syntactic clues and learning its semantic aspects. Therefore, the main objective of this study was to examine the development of the semantic aspect of ...

متن کامل

Computational Lexicography and Lexicology A Large-Scale Extension of VerbNet with Novel Verb Classes

Lexical classifications have proved useful in supporting various linguistic and natural language processing (NLP) tasks. The largest verb classification in English is Levin's (1993) work. VerbNet (Kipper-Schuler 2006) the largest computational verb lexicon currently available for English provides detailed syntactic-semantic descriptions of Levin classes. While the classes included are extensive...

متن کامل

Automatic Verb Classification Using Distributions of Grammatical Features

We apply machine learning techniques to classify automatically a set of verbs into lexical semantic classes, based on distributional approximations of diathe-ses, extracted from a very large annotated corpus. Distributions of four grammatical features are sufficient to reduce error rate by 50% over chance. We conclude that corpus data is a usable repository of verb class information, and that c...

متن کامل

Classifying Arabic Verbs Using Sibling Classes

In the effort of building a verb lexicon classifying the most used verbs in Arabic and providing information about their syntax and semantics (Mousser, 2010), the problem of classes over-generation arises because of the overt morphology of Arabic, which codes not only agreement and inflection relations but also semantic information related to thematic arity or other semantic information like ”i...

متن کامل

Supervised Learning of a Probabilistic Lexicon of Verb Semantic Classes

The work presented in this paper explores a supervised method for learning a probabilistic model of a lexicon of VerbNet classes. We intend for the probabilistic model to provide a probability distribution of verb-class associations, over known and unknown verbs, including polysemous words. In our approach, training instances are obtained from an existing lexicon and/or from an annotated corpus...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004